Text Mining for News and Blogs Analysis
نویسنده
چکیده
News and blogs are two types of media that generate and offer informational resources. News is any information whose revelation is anticipated to have an intellectual or actionable impact on the recipient. The dominant type of news in text analysis is that pertaining to current events. Originally referring to print-based news from press agencies or end-user news providers (like individual newspapers or serials), it now increasingly refers to Web-based news in the online editions of the same providers or in online-only news media. The term is generally understood to denote only the reports in news media, not opinion or comment pieces. A blog is a (more or less) frequently updated publication on the Web, sorted in (usually reverse) chronological order of the constituent blog posts. The content may reflect any interest including personal, journalistic, or corporate. Blogs were originally called weblogs. To avoid confusion with web server log files that are also known by this term, the abbreviation “blog” was coined and is now commonly used.
منابع مشابه
Discovering and Tracking Events From News, Blogs and Microblogs on the Web
Using three data sources, news, blogs, and microblogs, this study proposes a framework for discovering and tracking events embedded in free form online text. Existing methods for text mining are discussed for the three sources. Because three sources have different perspective, event analysis, region-topic model and rare keywords are proposed respectively. In order to integrate three data source...
متن کاملCoreference Resolution on Blogs and Commented News
We focus on automatic coreference resolution for blogs and news articles with user comments as part of a project on opinion mining. We aim to study the effect of the genre shift from edited structured newspaper text to unedited, unstructured blog data. We compare our coreference resolution system on three data sets: newspaper articles, mixed newspaper articles and reader comments, and blog data...
متن کاملAn Overview of Event Extraction from Text
One common application of text mining is event extraction, which encompasses deducing specific knowledge concerning incidents referred to in texts. Event extraction can be applied to various types of written text, e.g., (online) news messages, blogs, and manuscripts. This literature survey reviews text mining techniques that are employed for various event extraction purposes. It provides genera...
متن کاملOn Developing Extraction Rules for Mining Informal Scientific References from Altmetric Data Sources
Altmetrics measure scientific impact outside of traditional scientific literature. We identify mentions of scientific research or entities like researchers, academic or research organizations in a corpus containing blogs, articles, news items etc. We manually analysed the corpus for patterns of such informal mentions and then applied text mining techniques by developing extraction rules for min...
متن کاملLarge-Scale Sentiment Analysis for News and Blogs
Newspapers and blogs express opinion of news entities (people, places, things) while reporting on recent events. We present a system that assigns scores indicating positive or negative opinion to each distinct entity in the text corpus. Our system consists of a sentiment identification phase, which associates expressed opinions with each relevant entity, and a sentiment aggregation and scoring ...
متن کامل